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Abstract — We propose a distributed optimization method for 
solving a distributed model predictive consensus problem. The 
goal is to design a distributed controller for a network of 
dynamical systems to optimize a coupled objective function 
while respecting state and input constraints. The distributed op- 
timization method is an augmented Lagrangian method called 
the Alternating Direction Method of Multipliers (ADMM), 
which was introduced in the 1970s but has seen a recent 
resurgence in the context of dramatic increases in computing 
power and the development of widely available distributed com- 
puting platforms. The method is applied to position and velocity 
consensus in a network of double integrators. We find that a few 
tens of ADMM iterations yield closed-loop performance near 
what is achieved by solving the optimization problem centrally. 
Furthermore, the use of recent code generation techniques for 
solving local subproblems yields fast overall computation times. 

I. Introduction 

One of the grand challenges facing modern society is the 
understanding and efficient control of large, complex tech- 
nological networks, in which many dynamical subsystems 
interact. Such networks include power grids, the Internet, 
transportation networks, and unmanned vehicle formations. 
From a control perspective, one of the major difficulties is 
that there are structural constraints on information flow: each 
subsystem, or agent, in the network must act based on lim- 
ited information about the whole network, and furthermore, 
even if agents could gather global information, determining 
optimal inputs may be computationally prohibitive. 

In this paper, we consider an optimal consensus problem, 
in which a network of dynamically independent subsystems 
locally cooperate to optimize a global performance objective. 
There is a large and expanding literature on consensus 
problems; see e.g. [1], [2] and references therein for an 
overview. However, much less attention has been given to 
optimally achieving consensus in networks of dynamical 
systems, particularly when the subsystems are subject to state 
and input constraints. 

The existing literature on optimal consensus problems 
typically propose variations on distributed model predictive 
control. Model predictive control (MPC) is a well-developed 
and successful method for controlling systems with state 
and input constraints. In MPC, a model of the plant is 
used to predict future behavior of the system. At each 
measurement instant, an optimization problem is solved to 
choose a sequence of inputs that optimize the predicted 
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future behavior over a finite horizon. Only the first input 
in the sequence is applied to the system, and the process is 
repeated when new measurements are received. In distributed 
model predictive control schemes, each subsystem solves an 
optimization problem using only its own state and local state 
information measured or communicated from neighboring 
subsystems. 

Seminal work on distributed model predictive consensus 
includes [3]-[5]. In [4] and [5], the global optimization 
problem is broken into smaller problems for each agent, and 
sufficient conditions for stability are given. However, there 
is only a single communication of information between each 
state update rather than an iterative negotiation process, so 
these methods rely on neighboring agents not deviating too 
much from their plans and as a result are very conservative. 
Other related work on optimal consensus problems includes 
[6]— [13]. In [12], consensus schemes for systems with single 
and double integrator dynamics are considered. Geometric 
properties of the optimal path followed by individual agents 
are used to guarantee stability. Again, there is no iterative 
negotiation process, so the results are very conservative. In 
[7], [9], [10], the authors use an incremental subgradient 
method based on primal decomposition to negotiate an opti- 
mal consensus point. A stability analysis for these methods 
is given in [9]. There has also been a significant amount of 
work on distributed model predictive control for dynamically 
coupled systems, including [14]-[22]. 

Much of the work on distributed model predictive control 
and consensus focuses on deriving conditions that guarantee 
stability, but often requires extremely conservative assump- 
tions. Moreover, while several different distributed optimiza- 
tion algorithms and variations thereof have been proposed to 
iteratively negotiate the solution to the optimization problem, 
the performance of these algorithms in distributed MPC 
problems remains poorly understood, especially in terms of 
the closed-loop performance of the policy. 

In this paper, we propose to use the Alternating Direction 
Method of Multipliers (ADMM) for solving a distributed 
model predictive consensus problem. ADMM is an aug- 
mented Lagrangian method originally proposed in the 1970s 
in [23] and [24], and recently reviewed in [25]. It can be 
seen as a variation of dual decomposition that provides 
improved theoretical and practical convergence properties. 
The main advantages of using this method are: (1) in theory, 
convergence is guaranteed for any convex cost functions and 
constraints, and (2), in practice, the augmented Lagrangian 
term often speeds up convergence. We apply the method 
to a flocking problem in a network of double integrators 



and find that a few tens of ADMM iterations yield closed- 
loop performance near what is achieved by solving the 
optimization problem centrally. We also utilize recent code 
generation techniques for convex optimization [26], [27] for 
solving the local subproblems, thereby achieving surprisingly 
fast overall computation times. 

The rest of the paper is organized as follows. Section 2 
formulates a distributed model predictive consensus prob- 
lem. Section 3 reviews the Alternating Direction Method of 
Multipliers and describes how it can be used to solve the 
distributed model predictive consensus problem. Section 4 
demonstrates the results with a numerical example. Section 5 
gives concluding remarks and an outlook for future research. 

II. Distributed Model Predictive Consensus 
A. An Optimal Consensus Problem 

We consider a network of N independent agents, who are 
required to cooperatively solve an optimal control problem 
in a distributed way. The agents are allowed to communicate 
information according to a fixed undirectecQ graph G — 
(V,E), which we refer to as the information architecture. 
The vertex set V = {1,...,N} represents the agents and 
the edge set E C V x V specifies pairs of agents that are 
permitted to communicate. If G E, we say that agents i 
and j are neighbors, and we denote by A/* = {j : € E} 
the set of neighbors of agent i. 

The dynamics for the ith agent are given by the discrete- 
time linear state equation 



x i (t + l)=A i Xi(t) + B i u i (t), i = l, 



,N, 



(1) 



where Xi(t) <E R Mi is the state, Ui(t) 6 R mi is the control 
input, and Ai e R" iX "' and B { g R"' xm ' are given 
dynamics and input matrices. The global system state and 
input are concatenations of local states and inputs: x(t) = 
[x 1 (t) T ,...,x N {t) T ] T and u{t) = [u 1 {t) T ,...,u N {t) T ] T . 
The objective is to minimize the infinite horizon cost function 



oo N 



(2) 



t=0 i=l 



where ^»(v) 816 convex, closed, and proper stage cost 
functions and x^it) and u^it) are concatenations of the 
state and input vectors of agent i and its neighbors. The 
stage cost functions couple the independent agents according 
to G. Each agent is subject to convex local state and input 
constraints 

xjs/ z (t) G Xi ujs/ z (t) e Ui, 

where X{ and Ui are convex subsets that (possibly) couple 
the states and inputs of neighboring agents. The global 
constraints are X C R^; ,li and U C R^; mi . In general, 
we seek a distributed optimal policy 7r : X — > U whose 
structure is specified by the information architecture, i.e. the 
policy for agent i should depend only on information from 
its neighbors in G. 

'Directed graphs are easily incorporated into our framework, but we 
restrict attention to undirected graphs to simplify the exposition. 



B. Model Predictive Control 

We employ a model predictive control scheme, in which a 
model is used to predict future behavior of the system, and 
the inputs are chosen to optimize a performance index over 
a finite horizon. The infinite horizon optimization problem 
Q is replaced by a finite-horizon optimization problem that 
is solved at each time instant in a receding horizon fashion. 
Only the first input of the optimizer is applied to the system; 
the problem is solved again whenever state information is 
received. The finite-horizon optimization problem is 

T-l JV 

minimize 

t=0 i=l 
N 

+ EM 2; ^(t i W 1 (t)) 

<=* 0) 
subject to Xi(t + 1) = AiXi(t) + BiUi(t), 

x^t) e Xi, u^ t (t) e Ui, 

i = l,...,N, i = 0,1,...,T- 1 
x^(T)eX tf i = l,...,N, 

where £;/(•) and Xif are terminal cost functions and terminal 
constraint sets, respectively. For centralized problems, there 
are various methods for choosing the terminal cost functions 
and constraint sets to ensure feasibility and stability of the 
closed-loop system; see e.g. [28]. However, for distributed 
problems, these quantities should be chosen to reflect the 
information architecture, i.e. the terminal cost and constraint 
sets for agent i should only depend on xj^ t . Recent work has 
considered the problem of choosing terminal costs and con- 
straint sets with a certain distributed structure for feasibility 
and stability [21]. 

C. Consistency Constraint Form 

All coupling variables in the problem can be reduced to so- 
called consistency constraints by introducing local copies of 
variables at each subsystem and requiring the local copies of 
coupling variables to be the same across coupled subsystems. 
Specifically, let be a local variable vector for agent i that 
includes a copy of the state and input vectors over the finite 
horizon and also copies of the state and input vectors over 
the finite horizon of all neighbors. For example, if agent 1 
is connected to agent 2, then X! includes a local copy of X\, 
Mi, X2, and ii2 over the finite horizon. Likewise, for agent 
2, the variable X2 contains a separate local copy of x\, u\, 
X2, and U2 over the finite horizon. Let x = [x^,...,x^] T 
be a vector that includes all copies of all variables in the 
problem. The problem then has the form 



minimize 



N 



i=l 



E,-x, 



(4) 



1. 



,N, 



subject to Xi 

where E, is a matrix that picks out components of x that 
match components of the local variable x, (to simplify 
notation, there may be some redundant coupling constraints 
as written, but we assume that redundant coupling constraints 



are removed). The constraint set Xj is convex and includes 
all constraints for agent i and all of its neighbors. The 
problem (|4]) now has a separable cost function in variables 
x.j with only coupling equality constraints, which enforce 
consistency of local variable copies. 

Our focus is on using a distributed optimization method to 
solve Q (hence, ([3])) by distributing the computation across 
a collection of communicating processors. Several different 
distributed optimization methods have been proposed for 
solving finite-horizon optimization problem arising from 
distributed model predictive control and consensus problems 
[3]-[22]. Many of them are based on some version of 
standard dual decomposition, in which a partial Lagrangian 
is formed by adding the coupling equality constraints to 
the objective function. However, these methods provably 
converge only under strong assumptions (e.g. that the cost 
function is strictly convex) and may also be slow to converge 
in practice. The Alternating Direction Method of Multipliers 
(ADMM) is an augmented Lagrangian method that guaran- 
tees convergence under very mild assumptions and also has 
been observed to improve convergence speed in practice. We 
briefly review this method in the following section. 

III. The Alternating Direction Method of 
Multipliers 

ADMM was originally proposed in the 1970s in [23] 
and [24], but has seen a recent resurgence in the context 
of increased computational power and distributed computing 
platforms. An excellent recent overview of the method with 
applications to statistics and machine learning is provided in 
[25]; the reader is referred there for details. In this section, 
we briefly review the highlights. 

A. Dual Decomposition 

Consider the optimization problem 



minimize /(x) = ^/;(x; 

X " i— 1 

subject to Ax = b 



(5) 



where Xj € R ni are subvectors of the global variable 
x = [xf , ...,x^] T and A is partitioned comformably A = 
L4i ■••Aff], Note that (j4j) has the same form as problem 
d5ji, with A — diag(J ni ) — E, where E 
and 6 = 0. The Lagrangian for problem <j5j is 



[Ef, ...,Ej ! ] T 



N 



L(x,A) = ^(/ i (x i ) + A T A i x i 

i=l 

N 



N 



X T b) 



(6) 



where A are Lagrange multipliers, or dual variables, and the 
corresponding dual function is 



Dual decomposition involves applying gradient ascent to 
maximize the dual function, yielding the algorithm 



xf +1 = argmin L i {x i ,X k ) 
A fc+1 = \ k +a k (Ax k+1 -b) 



(8) 



where k is an iteration counter and a k is a step size. Since 
the Lagrangian is separable in the partition of x, the x« min- 
imizations in the first line of the algorithm can be performed 
independently and in parallel for all i = 1 , . . . , N for a given 
A fe . The residuals AiX k+1 — bi are then gathered to update 
the dual variable in the second line of the algorithm. When 
the coupling constraint matrix A has particular structure, as 
in problem Q, the dual update need not be performed by a 
centralized entity, but rather can be done locally by sharing 
information only amongst coupled subsystems. 

While dual decomposition yields a distributed algorithm 
for solving problem {3), it is only guaranteed to converge for 
suitably chosen step size sequences and under rather strong 
assumptions on the original problem. One way to improve 
the convergence guarantees is to augment the Lagrangian 
with an additional penalty term, which we discuss next. 

B. The Method of Multipliers 

The augmented Lagrangian includes an additional penalty 
term: 

L p (x, A) = /(x) + X T (Ax - b) + %\\Ax - b\\l (9) 

where p > is called the penalty parameter, and can be seen 
as the unaugmented Lagrangian for the problem 



minimize 



/(x) 



\Ax-b\ 



subject to Ax = b, 



(10) 



which has the same minimizer as problem (|5j. The penalty 
term has a smoothing effect that renders the dual function 
differentiable under mild conditions on the primal problem. 
The gradient ascent algorithm on the dual of ( 10 1 con- 



<?(A)=infZ(x, A). 



(7) 



verges under very mild conditions on the primal problem. 
In particular, the primal objective function need not be 
strictly convex, finite, or even differentiable. However, while 
the convergence properties have improved, the quadratic 
penalty term prevents the x minimization in the dual ascent 
algorithm from being separable, and therefore cannot be used 
to distribute the computation across a set of processors, even 
when the objective function / is separable. 

C. ADMM 

ADMM can be viewed as a way to combine the robustness 
of the method of multipliers with the decompos ability of 
standard dual decomposition. We now focus specifically on 
coupling constraints of the form given in Q, which results 
in a particular form of ADMM that is referred to in [25] 
as "general form global consensus". A global variable z is 



minimize 

Xi£Xj 



JV 

E. 

i=l 



(ID 



introduced, and the algorithm solves problems of the form 

subject to Xj — E^z = 0, £ = 1, N. 
The augmented Lagrangian for problem (JTTJ is 

iv 

L p (x,z,X) =y^[/t(xi) 

i=l 

+Af(x 4 -E 4 z) 
= ^pi(xi, z, A) 



fN-E^||i 



(12) 



The ADMM algorithm consists of the iterations 
xf +1 = argmin L pi (x l7 z k ,X k ) 

x,6X, 

z k+1 = argmin L p (x fc+1 , z, A fe ) 



(13) 



A 



fe+i 



Af + p(x 4 fc+1 -E l z fc + 1 ) 

The method of multipliers for problem ( [TT] ) would jointly 
minimize over x and z, whereas here the algorithm alter- 
nates sequentially between x and z minimizations, which 
allows the x^ minimizations to be done in parallel. In this 
particular form, the z minimization also distributes across its 
components and can be simplified to 



v fe+i 



1 , 



(14) 



where (x k )i and {\ k )i denote the components of x 3 and Xj 
that map to the ith component of z. It can be shown that 
after one iteration, the local average of the dual variables for 
each component of z, viz. J^jeAf- C^jO*> * s zero - Hence, 
the z update further reduces to 

il + i .4^ . J>1 ' 



(15) 



i.e. the z update is simply a local averaging of all "opinions" 
of each agent on the particular component of z (see [25]). 

Algorithm 1 Alternating Direction Method of Multipliers 
1: Vz in parallel: 
2: initialize A = 0, z = 
3: repeat 



k+l 



argmin L p (xi, z k , X k ) 



X;6X; 

communicate x+ to all j e A/"j 



v fe+i 



Af +1 =\* + p(xf +1 -E 4 z fe + 1 ) 
until convergence or maximum iterations reached 



ADMM is a distributed message passing algorithm. In the 
context of the distributed model predictive control problem, 
the algorithm works as follows. Each agent solves a local 



subproblem to minimize its local cost function, which com- 
putes a local plan for both its own state and input and also for 
states and inputs of neighboring agents over the horizon. The 
solution of this problem (step 1 in ( fT3~| l) can be interpreted 
as the agent's own "opinion" on what it and its neighbors 
should do to minimize its local cost. Neighboring agents then 
communicate their plans to one another. The global variable 
is updated (step 2 in ( 13 1) by averaging the local plans. The 



13 i) based on the 



dual variables are then updated (step 3 in 
difference between the local plans and the averaged global 
plan. The whole iterative negotiation process takes place 
within each sampling period, every time a state measurement 
is obtained. 

At each iteration of the algorithm, each subsystem solves 
a small (relative to the global problem), structured convex 
optimization problem. Very recently, code generation tech- 
niques for convex optimization have been developed [26], 
[27]. By exploiting problem structure, these techniques can 
enable extremely fast computation times for small convex 
optimization problems (on the order of microseconds to mil- 
liseconds, depending on problem size and processor capabili- 
ties). ADMM breaks a large optimization problem into many 
smaller optimization problems that can be spread across mul- 
tiple processors. Combining code generation techniques with 
a distributed optimization method like ADMM potentially 
allows solving large optimization problems surprisingly fast. 

The distributed model predictive control literature often 
lacks evaluations of proposed distributed optimization al- 
gorithms in terms of computation times and performance 
comparison to centralized schemes. In the next section, we 
explore the practical performance of ADMM on a distributed 
model predictive consensus problem. 



IV. Numerical Example 

In this section, we apply the method to a consensus 
problem for a collection of double integrators moving in 
three-dimensional space with input constraints. The state 
of each agent G R 6 consists of spatial positions (the 
first, third, and fifth components) and velocities (the second, 
fourth, and sixth components). The input ui £ R 3 consists 
of force components acting on the agent. After an Euler 
discretization with step size T s , the dynamics and input 
matrices for each agent are 



A, = h 



1 T s 
1 



B, = h 





T s /rm 



(16) 



where ® is the Kronecker product and mi is the mass of 
agent i. We consider input constraints only: Xi = R 6 and 



Ui = {u G R 



< U r . 



We consider a flocking problem, in which the objective is 
for all agents to converge to a common position and velocity. 
One possible stage cost function that encodes this is 

£(x(t),u(t)) =x(t) T (I 3 <S>(L®I 2 ))x(t)+u(t) T Ru(t), (17) 



where L is the weighted graph Laplacian matri^ and R is 
a positive-definite matrix. In the simulations in the section, 
we use R — I3 (g) Jjv. We also include process noise that 
acts independently on each component of the acceleration 
of each agent and has zero-mean and covariance 0.1. We 
consider N = 5 agents connected in a simple path graph. 
The prediction horizon is T = 10. 

To evaluate the method, we generated random initial 
conditions and random noise sequences and ran a closed- 
loop simulation for 250 steps. At each step, the agents obtain 
a state measurement and communicate their state to their 
neighbors. Then the agents commence the iterative ADMM 
algorithm described in the previous section, communicating 
the intermediate results of local optimizations to neighbors 
and locally updating global variables and Lagrange multi- 
pliers. We fixed the maximum number of ADMM iterations 
to 30 and compared the closed-loop performance with the 
performance achieved when solving the finite-horizon model 
predictive control problem centrally. 

Figures 1-3 show example state and input time trajectories 
for each spatial component for a randomly generated initial 
condition and noise sequence. The solid blue curve shows the 
trajectory using ADMM in the closed-loop, and the dashed 
red curve shows the trajectory with the finite horizon MPC 
problem solved with a centralized solver. Figure 4 shows a 
spatial plot of the 5 agents moving through space. We can 
see that both methods achieve consensus, but converge to 
slightly different points on the consensus manifold. Figure 5 
shows a comparison of closed-loop performance (expressed 
as percentage of total cost) achieved by a centralized solver 
with the performance of ADMM with various maximum 
number of iterations. For each maximum iteration number, 
we randomly generated 120 initial conditions and noise 
sequences, ran a 250-step closed-loop simulation, and av- 
eraged the results. If only 1 iteration is performed, the 
performance is about 80% worse; however, for this example, 
using just 2 iterations achieves performance within 1.5% 
of the centralized solver and using 10 or more achieves 
performance within 0.5% (on average). 

We used C code generated from CVXGEN [26] to solve 
the local subproblems. Using a 3.1 GHz Intel Core i5 
processor, the local subproblems are solved in under 2 
milliseconds; thus, if the local subproblems were solved in 
parallel, the total computation time to finish 30 negotiations 
to compute the optimal inputs at each sampling time would 
be under 60 milliseconds. There is evidence that ADMM 
scales very well with problem size; a computational study 
in the context of distributed model predictive control is 
performed in [22], and in [29], a dynamic energy network 
scheduling problem with 10 million variables was reported 
to be capable of being solved in a parallel implementation 

2 The weighted graph Laplacian matrix L S ~R. NxN for a graph G = 
(V, E) is defined as follows. We first define the weighted adjacency matrix 
A £ ~R, NxN , which encodes the communicated state information flow. The 
diagonal entries of A are zero, and an off-diagonal entry Hij is positive 
if agents i and j are neighbors. Then the Laplacian matrix is defined by 
— y " i ^ij and l^j — Q'ij ■ 
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Fig. 1. x component time trajectories. 
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Fig. 2. y component time trajectories. 

in under 1 second. Thus, there is potential to solve very 
large problems surprisingly fast by combining distributed 
optimization methods like ADMM with fast custom solvers 
for local subproblems. 

V. Conclusions 

In this paper, we proposed the Alternating Direction 
Method of Multipliers (ADMM), which has good theoretical 
and practical convergence properties, for solving a distributed 
model predictive consensus problem. A numerical example 
illustrates that near-centralized performance can be achieved 
with only a few tens of iterations of the distributed method. 

There are many potential directions for future research. On 
the theoretical side, while ADMM has good theoretical and 
practical convergence properties, the consistency constraints 
of the iterates are only feasible in the limit. So guaranteeing 
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Fig. 4. Spatial plot of agent trajectories. 
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Fig. 5. Closed-loop performance comparison. 

closed-loop stability for early stopping is an open problem 
and will require arguments different from the standard ones 
used in model predictive control. Communication network 
non-idealities, e.g. delays and packet drops, are important 
and may dominate computation times in a distributed im- 
plementation, and should be considered in future research. 
Finally, there are many potential applications for distributed 
model predictive control and consensus, e.g. to vehicle 
formations and power networks. 
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